A Comparison of Similarity Techniques for Detecting Source Code Plagiarism

نویسنده

  • Bradley Beth
چکیده

Academic dishonesty is a universal problem. Detecting duplicated text among natural language artifacts is a welldocumented task. However, performing similar analysis on source code presents unique problems. In this paper, I present a comparison of the application of various techniques in textual similarity processing on source code. Beyond this, I investigate the application of textual similarity algorithms on the Abstract Syntax Tree (AST), Control Flow Graph (CFG), and Intermediate Representation (IR) produced by the LLVM compiler toolchain. Aggregate similarity scores of a variety of methods compare favorably against the current state-of-the-art source code plagiarism tool.

منابع مشابه

COAT: Code ObfuscAtion Tool to evaluate the performance of code plagiarism detection tools

There exist many plagiarism detection tools to uncover plagiarized codes by analyzing the similarity of source codes. To measure how reliable those plagiarism detection tools are, we developed a tool named Code ObfuscAtion Tool (COAT) that takes a program source code as input and produces another source code that is exactly equivalent to the input source code in their functional behaviors but w...

متن کامل

Detecting Functional Similarity between Java Files Using Metrics

Two similar pieces of software code is called clones. Software Developers often copy a section of code, and then paste it with or without modification .Software clone detection is employed to lessen the software maintenance cost and to improve understandability of the system. It also helps in plagiarism detection. Many code clone detection techniques exist and they detect and identify various t...

متن کامل

An Abstract Method Linearization for Detecting Source Code Plagiarism in Object-Oriented Environment

Despite the fact that plagiarizing source code is a trivial task for most CS students, detecting such unethical behavior requires a considerable amount of effort. Thus, several plagiarism detection systems were developed to handle such issue. This paper extends Karnalim’s work, a low-level approach for detecting Java source code plagiarism, by incorporating abstract method linearization. Such e...

متن کامل

A comparison of plagiarism detection tools

In this paper we compare five tools for detecting plagiarism in source code texts: JPlag, Marble, moss, Plaggie, and sim. The tools are compared with respect to their features and performance. For the performance comparison we carried out two experiments: to compare the sensitivity of the tools for different plagiarism techniques we have applied the tools to a set of intentionally plagiarised p...

متن کامل

A Source Code Similarity System for Plagiarism Detection

Source code plagiarism is an easy to do task, but very difficult to detect without proper tool support. Various source code similarity detection systems have been developed to help detect source code plagiarism. Those systems need to recognize a number of lexical and structural source code modifications. For example, by some structural modifications (e.g. modification of control structures, mod...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014